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Abstract 

Provided is a similarity search method that makes use of a localized distance 
metric. The data includes a collection of items, wherein each item is associated with a set 
of properties. The distance between two items is defined in terms of the number of items 
5 in the collection that are associated with the set of properties common to the two items. A 
query is generally composed of a set of properties. The distance between a query and an 
item is defined in terms of the number of items in the collection that are associated with 
the set of properties common to the query and the item. The properties can be of various 
types, such as binary, partially ordered, or numeric. The distance metric may be applied 
10 explicitly or implicitly for similarity search. One embodiment of this invention uses 
'^l random walks such that the similarity search can be performed exactly or approximately, 

nJ trading-off between accuracy and performance. The distance metric of the present 

t^h invention can also be the basis for matching and clustering applications. In these 

f^^ contexts, the distance metric of the present invention may be used to build a graph, to 

s 15 which matching or clustering algorithms can be applied. 
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